|
In population genetics, the allele frequency spectrum, sometimes called the site frequency spectrum, is the distribution of the allele frequencies of a given set of loci (often SNPs) in a population or sample. Because an allele frequency spectrum is often a summary of or compared to sequenced samples of the whole population, it is a histogram with size depending on the number of sequenced individual chromosomes. Each entry in the frequency spectrum records the total number of loci with the corresponding derived allele frequency. Loci contributing to the frequency spectrum are assumed to be independently changing in frequency. Furthermore, loci are assumed to be biallelic (that is, with exactly two alleles present), although extensions for multiallelic frequency spectra exist. Many summary statistics of observed genetic variation are themselves summaries of the allele frequency spectrum, including estimates of such as Watterson's and Tajima's , Tajima's D, Fay and Wu's H and . ==Example== The allele frequency spectrum from a sample of individuals is calculated by counting the number of sites with derived allele frequencies . For example, consider a sample of individuals with eight observed variable sites. In this table, a 1 indicates that the derived allele is observed at that site, while a 0 indicates the ancestral allele was observed. The allele frequency spectrum can be written as the vector , where is the number of observed sites with derived allele frequency . In this example, the observed allele frequency spectrum is . 抄文引用元・出典: フリー百科事典『 ウィキペディア(Wikipedia)』 ■ウィキペディアで「Allele frequency spectrum」の詳細全文を読む スポンサード リンク
|